Web Survey Bibliography
With more than 250 million active users [1], Facebook (FB) is currently one of the most important online social networks. Our goal in this paper is to obtain a representative (unbiased) sample of Facebook users by crawling its social graph. In this quest, we consider and implement several candidate techniques. Two approaches that are found to perform well are the Metropolis-Hasting random walk (MHRW) and a re-weighted random walk (RWRW). Both have pros and cons, which we demonstrate through a comparison to each other as well as to the ”ground-truth” (UNI - obtained through true uniform sampling of FB userIDs). In contrast, the traditional BreadthFirst-Search and Random Walk (without re-weighting) perform quite poorly, producing substantially biased results. In addition to offline performance assessment, we introduce online formal convergence diagnostics to assess sample quality during the data collection process. We show how these can be used to effectively determine when a random walk sample is of adequate size and quality for subsequent use (i.e., when it is safe to cease sampling). Using these methods, we collect the first to the best of our knowledge unbiased sample of Facebook. Finally, we use one of our representative datasets, collected through MHRW, to characterize several key properties of Facebook.
Author Homepage (abstract) / (full text)
Web survey bibliography (4086)
- The Future of Internet Research; 2010; Lavrakas, P. J.
- Accounting for the effects of data collection modes in population surveys; 2010; Huang, Y. C., Thompson, M. E., Boudreau, C., Fong, G. T.
- Using administrative data to find the best medium: Examples of mixed sources and mixed modes; 2010; Hartkamp, J., Rutjes, H.
- Broadband adoption and use in America; 2010; Horrigan, J.
- Applied survey data analysis; 2010; Heeringa, S. G., West, B. T., Berglund, P.
- Applied missing data analysis; 2010; Enders, C. K.
- Application of a check-all-that-apply question for the evaluation of strawberry cultivars from a breeding...; 2010; Lado, J., Vicente, E., Manzzioni, A., Ares, G.
- A framework for understanding and applying ethical principles in network and security research; 2010; Kenneally, E., Bailey, M., Maughan, D.
- Organizational Survey of Workplace Climate: Differences in Representation Across Response Modes; 2010; Mohr, D., Osatuke, K., Moore, S., Yanovsky, B., Brassell, T., Nagy, M.
- Strategies for High Response Rates Among Hard-to-Reach Respondents: A Case Study From the Communities...; 2010; Fox, L., Mulvey, C., Yamaguchi, R., Levin, M.
- Innovative mobile research in developing countries; 2010; Bellity, E.
- Mobile location based research: Cross cultural examination of coffee culture; 2010; Morden, M., Ferneyhough, C., Grenville, A.
- Online research….and all that Jazz!; 2010; Gittelman, S. H., Trimarchi, E.
- Why are we trying to create new communities for market research purposes?; 2010; Pearson, C., Kateley, V.
- Maximizing online respondent engagement through a game-way research design; 2010; Swahar, G., Swahar, J.
- Designing questions for mixed mode data collection: What have we learnt so far?; 2010; Nicolaas, G., Campanelli, P.
- Online panel survey, Change and stability of political attitudes; 2010
- The Internet, Electoral Politics and Citizen Participation in Global Perspective; 2010; Gibson, R., Cantijoch, M.
- Internet-Based Measurement With Visual Analogue Scales: An Experimental Investigation; 2010; Funke, F.
- Continuity and Innovation in the Design of Understanding Society: the UK Household Longitudinal Study...; 2010; Laurie, H.
- Weighting Strategy for Understanding Society; 2010; Lynn, P., Kaminska, O.
- Globalpark Annual Market Research Software Survey 2009; 2010; Macer, T., Wilson, S.
- Understanding Society Innovation Panel Wave 2: Results from Methodological Experiments ; 2010; Burton, J., Laurie, H., Uhrig, S. C. N.
- Offering a Web Option in a Mail Survey of Young Adults: Impact on Survey Quality; 2010; Turner, S., Viera Jr., L., Marsh, S. M.
- Using Web-Hosted Surveys to Obtain Responses from Extension Clients: A Cautionary Tale.; 2010; Israel, G. D.
- Mobile Experience Sampling: Reaching the Parts of Facebook Other Methods Cannot Reach; 2010; Abdesslem, F. B., Parris, I., Henderson, T.
- Investigating Data Quality in Cell Phone Surveying; 2010; Lavrakas, P. J., Tompson, T., Benford, R.
- Walking in Facebook: A Case Study of Unbiased Sampling of OSNs; 2010; Gjoka, M., Kurant, M., Butts, C. T., Markopoulou, A.
- Social Networking Sites: Evaluating and Investigating their use in Academic Research; 2010; Redmond, F.
- Update on the ARF’s Quality Enhancement Process (QeP); 2010; Pettit, R.
- Elastic-R, a Google docs-like portal for data analysis in the Cloud ; 2010; Chine, K.
- Restructuring and innovations on the survey “capacity of collective tourist accommodation”...; 2010; Santoro, M. T., Staffieri, S.
- Managing the knowledge base - the DUVA system, from data entry to output tools; 2010; Then, R., Bangert, D.
- An Analyze of the Zero Price Effect on Online Business Performance - An Research Based on the Mobile...; 2010; Liu, Y., Yuan, P.
- Is there a future for “real” qualitative market research interviewing in the digital age...; 2010; McPhee, N.
- From clipboards to online research communities; 2010; Poynter, R., Cierpicki, S., Lorch, J., Zuo, B., Davis, C., Eddy, C.
- 3 screen measurement: Soccer World Cup 2010; 2010; Conry, S., Benezra, K., Singh, S.
- Dealing with Nonresponse in Survey Sampling: an Item Response Modeling Approach; 2010; Matei, A.
- Power, sample size, and optimal designs in social research; 2010; Moerbeek, M., van Breukelen, G. J. P.
- Codebook and explanatory note on the WageIndicator dataset ; 2010; Tijdens, K., van Zijl, S., Hughie-Williams, M., van Klaveren, M., Steinmetz, S.
- Modeling non-sampling errors and participation in Web surveys; 2010; Biffignandi, S.
- Perspectives on Web Survey Development: Views from Programmers, Content Specialists, and Survey Methodologists...; 2010; Downey, K.
- Using a Mixed-Mode Design to Survey Ethnic Minorities?; 2010; Feskens, R., Kappelhof, J.
- Blogosphere and Democracy in Portugal–Results of a Websurvey; 2010; Carvalho, T., Casanova, J. L.
- Recent Findings on Using Rich Media in Online Surveys; 2010; Malinoff, B., Henning, J.
- Digital, Social Moms: Using Social Media to Increase Respondent Engagement and Decrease Recruiting Costs...; 2010; Stemberg, C., Rimmer, L., Weinstein, D.
- From Buzz to Biz: Social Media Research for Results; 2010; Pettit, F. A.
- Archiving and Re-using Qualitative and Qualitative Longitudinal Data in Slovenia; 2010; Stebe, J., Hudales, J., Kragelj, B.
- Establishing a Qualitative Data Archive in Austria; 2010; Smioski, A.
- Methodological and Ethical Dilemmas of Archiving Qualitative Data; 2010; Kuula, A.